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(57) Abstract 



Methods for determining an optimal binding motif for a binding compound are provided in which the binding compound is contacted 
with an oriented degenerate cyclic peptide library (ODCPL) under conditions which aUow for interaction between the binding compound and 
the ODCPL such that a complex is formed between the binding compound and a subpopulation of library members capable of interacting 
with the binding compound. The subpopulation of library memben capable of interacting with the binding compound is then separated 
from library members that are incapable of interacting with the binding compound. The subpopulation of library members capable of 
interacting with the binding compound is linearized to fonn a subpopulation of linearized library members. The amino acid sequence of the 
subpopulation of linearized library members is determined and an amino acid sequence motif is then determined for an interaction site of 
the binding compound, based upon the relative abundance of different amino acid residues at each degenerate position within the linearized 
library members. Oriented degenerate cyclic peptide libraries, and methods for purifying cyclic peptides from linear peptides, are also 
provided. 



♦(Refcired to in PCT Gazette No. 1 3/1 999. Section D) 



FOR THE PURPOSES OF INFORMATION ONLY 

Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCX. 



,AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Skyveoia 


AM 


Armenia 


Fl 


Finland 


LT 


T.ifh|nm<ii 


SK 


Stovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senega] 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azeibaijan 


GB 


United Kmgdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Henegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Baitados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Buricina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Maioo 


UZ 


Uzbekistan 


CF 


Central African ReimbUc 


JP 


Ji^an 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Swkzeriand 


KG 


Kyrgyzstan 


NO 


Norway 


zw 


Zimbabwe 


a 


COte d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Caineiooa 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






CZ 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






D£ 


Gexmany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


SriLaidca 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







wo 98/54577 



PCT/US98/10876 



CYCLIC PEPTIDE LIBRARIES AND METHODS OF USE THEREOF 
TO IDENTIFY BINDING MOTIFS 

Background of the Invention 

5 Many biological responses are mediated by the interaction of a binding protein 

with a target compound. Examples of such binding interactions include the interaction 
of an enzyme with its substrate (e.g., the interaction of kinases, proteases, phosphatases 
etc, with their substrates), the interaction of antibodies with antigens, the interaction of 
receptors with ligands and the interaction of SH2 domains with phosphotyrosine- 

10 containing targets. The specificity of a particular binding protein, such as an enzyme, 
typically has been determined by identifying a number of natural substrates for the 
protein, obtaining the sequence of these substrates and then comparing the sequences of 
these substrates to define a consensus motif for substrate binding. 

For example, information on the specificity of protein kinases has typically come 

15 from locating the phosphorylation sites on in vivo and/or in vitro substrates of the kinase 
and determining a consensus motif for the phosphorylation sites by comparing the 
sequences of the substrates, (See e.g., Taylor et al,, (1990) Ann. Rev. Biochem. 59:971- 
1005; Cheng, et al. (1991) J. Biol. Chem. 266:17919-17925; Walsh et al. (1990) in 
Peptides and Protein Phosphorylation. (B.E. Kemp, ed.) CRC Press Inc., pp. 43-84; 

20 Gaehlen and Harrison (1990) in Peptides and Protein Phosphorylation. (B.E. Kemp, ed.) 
CRC Press Inc., pp. 239-254). These types of studies have demonstrated the importance 
of the primary amino acid sequence around the site of phosphorylation in determining 
the in vivo specificity of protein kinases. Once a binding motif has been defined in this 
manner, synthetic peptides have been constructed based upon the consensus sequence 

25 motif and individual amino acids have been replaced one by one in order to determine 
the importance of particular amino acids on the Ki^ or V^j^ of the phosphorylation 
reaction. 

There are severe limitations, however, to this approach for determining the 
specificity of a binding protein. The procedure is quite expensive and laborious since 

30 each amino acid residue within a putative binding motif must be altered individually to 
evaluate its importance. Moreover, this approach does not necessarily identify all the 
residues critical for substrate specificity. Furthermore, an optimal substrate sequence is 
not likely to be determined unless each residue is changed to every other possible amino 
acid residue individually and then evaluated. For example, based upon an estimation of 

35 9 to 12 amino acid residues of a substrate contacting the active site cleft of a kinase there 
would be approximately 1.024 x 10^^ (20^^) distinct peptides to consider. Moreover, in 
many cases this approach is not feasible because in vivo substrates for the binding 
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protein cannot be determined with certainty. For example, if an extracellular signal 
activates multiple kinases which phosphorylate multiple substrates or if a signal 
activates a cascade of kinases it is difficult to determine which kinase phosphorylates 
which substrate. An even more difficult problem in determining the substrate specificity 
5 of a particular binding protein, especially enzymes, is that their critical in vivo substrates 
are often proteins which are present in vivo in very low abundance and thus are not 
easily detectable by in vivo assays. 

Thus, alternative methods to that of isolating and examining the sequences of 
native substrates are desirable for determining the substrate specificity of a binding 

1 0 protein. Songyang et al {Cell (1993) 72\161'11%) describe a method for determining 
the sequence specificity of the peptide-binding sites of SH2 domains using oriented 
degenerate phosphopeptide libraries. In this approach a library of linear peptides 
containing a fixed phosphotyrosine residue is used to select the optimal phosphopeptide 
substrates for a particular SH2 domain. A similar approach has been applied to the 

15 determination of optimal substrates for protein kinases (see U.S. Patent No. 5,532,167). 
These methodologies utilized linear peptide libraries composed entirely of naturally- 
occurring a-amino acids. One drawback to use of such linear, natural peptides is that 
they may not appropriately mimic the conformation of a target binding site within a 
larger protein substrate, due to the high flexibility of linear peptides in solution. 

20 Moreover, linear peptides composed entirely of natural amino acids are highly 

susceptible to degradation by proteases in vivo, which could limit the ability to use such 
peptides as pharmacological agents. Accordingly, new types of libraries and improved 
methods for identifying binding motifs for binding compounds are needed. . 

25 Summary of the Invention 

This invention provides improved methods and compositions for identifying 
binding motifs for binding compounds, which utilize cyclic peptides composed of 
natural and/or unnatural a-amino acids. The methods and compositions of the mvention 
allow for determination of an optimal binding motif for a binding compound within the 

30 context of a cyclic peptide. Cyclic peptides comprising these optimal motifs then can be 
used as pharmalogical agents to modulate the biological activity of the binding 
compound. The advantages of the methods and compositions of the invention include 
that since the cyclic peptides exhibit constrained flexibility compared to linear forms, the 
cyclic peptides are more likely to mimic the conformation of a target bmding site within 

35 a larger protein substrate. It has been shown that cyclic and linear peptides can have 

similar binding motifs and yet interact with binding proteins with different affinities, due 
to differences in structure of the peptide resulting firom cyclization. Since sequence and 
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Structure play a role in determining the optimal binding motif, the optimal sunino acid 
sequence determined with a cyclic peptide library can differ from the optimal sequence 
obtained when a linear peptide library is allowed to interact with the binding compound. 
Cyclic peptides comprising these optimal motifs then can be used as 
5 pharmacological agents to modulate the biological activity of the binding compound. 
Furthermore, cyclic peptides are more resistant to degradation compared to linear forms 
and thus are more amenable to pharmaceutical use. 

Cyclic peptides comprising these optimal binding motifs can be used to design 
pharmacological agents to modulate the biological activity of the binding compound. 

10 Since the optimal amino acid sequence for cyclic library binding may differ from that for 
linear libraries, this approach allows for the development of pharmacological agents that 
would never have been identified using a linear library technique. 

The methods of the invention for identifying binding motifs for binding 
compounds can be applied to essentially any compound that has afFmity for cyclic 

15 peptides. Non-limiting examples of binding compoimds that can be analyzed according 
to the methods of the invention include kinases (e.g., protein serine/threonine kinases, 
protein tyrosine kinases, lipid kinases), phosphatases (e.g., protein phosphatases, lipid 
phosphatases), proteases (e.g., serine proteases, cysteine proteases), SH2 domains, SH3 
domains, antibodies, WW domains, PTB domains, PDZ domains, LIM domains, 

20 pleckstrin homology domains, zinc finger domains, extracellular growth factors and 
receptors, adhesion molecules, intercellular signaling molecules, 7-transmembrane 
receptor proteins, ion channels, methyltransferases, ubiquitinating enzymes and peptidyl- 
transferases. 

One aspect of the invention pertains to a method for determining an amino acid 
25 sequence motif for an interaction site of a binding compoimd. The method involves 
contacting an oriented degenerate cyclic peptide library (ODCPL) v^th a binding 
compound under conditions which allow for interaction between the binding compoimd 
and the ODCPL. The ODCPL includes library members having an identifiable amino 
acid residue at a fixed non-degenerate position. The binding compound interacts with 
30 the ODCPL such that a complex is formed between the binding compound and a 
subpopulation of library members capable of interacting with the binding compound. 
The subpopulation of library members capable of interacting with the binding compound 
is then separated from library members that are incapable of interacting with the binding 
compound. The subpopulation of library members capable of interacting with the 
35 binding compound is linearized to form a subpopulation of linearized library members. 
The amino acid sequence of the subpopulation of linearized library members is 
determined. An amino acid sequence motif is then determined for an interaction site of 
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the binding compound, based upon the relative abundance of different amino acid 
residues at each degenerate position within the linearized library members. 

Another aspect of the invention pertains to oriented degenerate cyclic peptide 
libraries (ODCPL). In a preferred embodiment, the ODCPL comprises cyclic peptides 
5 having the formula: 




Zaa is a non-degenerate natural or unnatural a-amino acid. X^a is any natural or 

10 unnatural a-amino acid. R-R' is a dipeptide specifically cleavable under cleavage 
conditions to allow for linearization of the peptide, n and m are each independently 
selected from 0-10 inclusive, with the proviso that if n is 0, m is selected from 1-10 
inclusive and if m is 0, n is selected from 1-10 inclusive. 

The invention still further provides a method for purifying cyclic peptides from 

1 S linear peptides. The method involves first providing a mixture of cyclic peptides and 
linear peptides. The mixture is then contacted with a blocking agent that reacts with the 
free amino termini of the linear peptides to form amino-protected linear peptides. The 
mixture then is contacted with a binding agent that is capable of interacting with the 
amino-protected linear peptides but incapable of interacting with the cyclic peptides. 

20 Finally, the amino-protected linear peptides are separated from the cyclic peptides to 
thereby purify the cyclic peptides. In a preferred embodiment, the blocking agent is a 
biotin, which biotinylates the free amino-termini of the linear peptides, and the binding 
agent is a biotin-binding agent, such as avidin or streptavidin. For example, biotinylated 
linear peptides can be separated from cyclic peptides by passing the mixture over an 

25 avidin colimin. 

The invention also provides a method for determining an amino acid sequence 
motif for an interaction site of a protease. The method includes contacting an oriented 
degenerate cyclic peptide library (ODCPL) with a protease under conditions which allow 
for interaction between the protease and the ODCPL, wherein the ODCPL includes 

30 library members having an identifiable amino acid residue at a fixed non-degenerate 

position. The protease mteracts with the ODCPL such that a complex is formed between 
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the protease and a subpopulation of library members capable of interacting with the 
protease. The subpopulation of library members capable of interacting with the protease 
to form a subpopulation of linearized library members is then linearized. The amino 
acid sequence of the subpopulation of linearized library members is determined and an 
5 amino acid sequence motif for an interaction site of the protease, based upon relative 
abundance of different amino acid residues at each degenerate position within the 
linearized library members is also determined. An example of a suitable protease is 
chymotrypsin, 

10 Description of the Drawings 

Figure 1 is a schematic diagram of on-resin cyclic peptide synthesis using alpha- 
allyl esters. 

Figure 2A is an exemplary schematic diagram of separating cyclic peptides from 
linear peptides. 

1 5 Figure 2B shows a mass spectrum of a mixture of peptides. 

Figure 2C shows the mass spectrum of 2B after removal of contaminants. 
Figures 3A, 3B and 3C are bar graphs depicting the relative abundance of each 
amino acid residue at degenerate positions Tyr-2, Tyr-1 and Tyr+1, respectively, of 
peptides from members of an oriented degenerate cyclic peptide library that were 
20 cleaved by chymotrypsin. 

Figure 4 is a schematic which depicts the cleavage of a phosphorylated oriented 
degenerate cyclic peptide. 

Detailed Description of the Invention 

25 This invention pertains to methods for determining an amino acid sequence motif 

for an interaction site of a bmding compound, oriented degenerate cyclic peptide 
libraries (ODCPL) for use in such methods, and methods for purifying cyclic peptides 
from linear peptides. The invention allows for the determination of the optimal 
interaction site motif for a particular binding compound within the context of a cyclic 

30 peptide. This method has the advantage that it can be used to determine an interactive 
site for any binding compound, regardless of whether native substrates for that binding 
compound have been identified. Furthermore, since the method involves selection of 
oriented degenerate cyclic peptides which interact most readily with a binding 
compound, the amino acid sequence motif that is determined represents the optimal 

35 interactive site for that binding compound. 

Based upon the amino acid sequence motif of the interaction site of a bmding 
compound determined by the method of the invention, oriented degenerate cyclic 
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peptides and peptide libraries can be made which are substrates for that particular 
binding compound. Cyclic peptides comprising the most preferred amino acid residues 
of a sequence motif represent optimal cyclic substrates for the binding compound. 
Cyclic peptides, or modified forms thereof, designed based upon the interaction site 
5 motifs provided by the invention can be used to detect and quantitate binding 

compounds. Furthermore, due to their high affinity for the binding compound, such 
cyclic peptides can be used to modulate the activity of the binding compound. 
Pseudosubstrates and analogs can also be designed baised upon the amino acid sequence 
motifs provided by the invention. In a preferred embodiment, the amino acid sequence 
10 motif for the optimal interaction site is used to design pharmaceutical agents for 
modulating the biological activity of the binding compound. 

So that the invention may be more readily understood, certain terms are first 
defined. 

The term "cyclic peptide" is intended to include peptides composed of natural 

1 5 and/or unnatural a-amino acids that have been cyclized by formation of an intrapeptide 
bond. This intrapeptide bond preferably is fomned between the amino-terminal amino 
group of the peptide and the carboxy-terminal carboxy group of the peptide (referred to 
as "head to tail" cyclization), although the term "cyclic peptide" is also intended to 
include peptides that have been cyclized by intrapeptide bond formation involving a side 

20 chain group(s). The term "unnatural a-amino acids" is intended to include analogs, 
derivatives and congeners of naturally occurring amino acids. For example, unnatural 
amino acids can have lengthened or shortened side chains or variant side chains with 
appropriate functional groups. Also included are the D and L stereoisomers of an amino 
acid when the structure of the amino acid admits of stereoisomeric forms. 

25 The term "oriented degenerate cyclic peptide library (ODCPL)" is intended to 

include populations of cyclic peptides (composed of natural and/or uimatural a-amino 
acids) in which different amino acid residues are present at the same position in different 
peptides within the library. For example, a population of peptides of 10 amino acids in 
length in which the amino acid residue at position 5 of the peptides can be any one of the 

30 twenty natural amino acids would be a degenerate peptide library. A position within the 
peptides which is occupied by different amino acids in different peptides is referred to 
herein as a "degenerate position", whereas a position within the peptides which is 
occupied by the same amino acid in different peptides is referred to herein as a "non- 
degenerate position". The "oriented degenerate cyclic peptide library" used in the 

35 method of the invention is composed of cyclic peptides which have at least one amino 
acid residue at a fixed, non-degenerate position. The term "fixed position" indicates that 
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the peptides contained within the library all have the same non-degenerate amino acid 
residue at that particular position within the peptides. 

The term "binding compound" is intended to include all compounds which can 
interact with an interaction site on a cyclic peptide by one or more mechanisms. 
5 Examples of suitable binding compounds include kinases (e.g., protein serine/threonine 
kinases, protein tyrosine kinases, lipid kinases), phosphatases (e.g., protein 
phosphatases, lipid phosphatases), proteases (e.g., serine proteases, cysteine proteases), 
SH2 domains, SH3 domains, antibodies, WW domains, PTB domains, PDZ domains, 
LIM domains, pleckstrin homology domains, zinc finger domains, extracellular growth 
10 factors and receptors, adhesion molecules, intercellular signaling molecules, 7- 
transmembrane receptor proteins, ion channels, methyltransferases, ubiquitinating 
enzymes and peptidyl-transferases. 

The term "interaction" is intended to include those attractive forces which 
physically combine a binding compound with an interaction site on an oriented 
15 degenerate cyclic peptide. Such attractive forces include hydrophobic interactions, 
hydrophilic interactions, covalent binding, ionic binding, charged interactions, etc. 

The phrase "an amino acid sequence motif for an interaction site" is intended to 
describe a composite amino acid sequence which represent a consensus sequence for an 
interaction site recognized by a binding compound. An amino acid sequence motif for 
20 an interaction site typically encompasses the region including and surrounding an amino 
acid residue(s) which specifically and preferentially interacts with a binding compound. 

Various aspects of the invention are described in further detail below: 

I. Methods for Identifying Interaction Site Motifs for Binding Compounds 
2S The invention provides a method for determining an amino acid sequence motif 

for an interaction site of a binding compound. The method of the invention involves: 

contacting an oriented degenerate cyclic peptide library (ODCPL) with a binding 
compound under conditions which allow for interaction between the binding compound 
and the ODCPL, wherein the ODCPL comprises library members having an identifiable 
30 amino acid residue at a fixed non-degenerate position; 

allowing the binding compound to interact with the ODCPL such that a complex 
is formed between the binding compound and a subpopulation of library members 
capable of interacting with the binding compound; 

separating the subpopulation of library members capable of interacting with the 
35 binding compound firom library members that are incapable of interacting with the 
bindmg compound; 
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-8- 

linearizing the subpopulation of library members capable of interacting with the 
binding compound to form a subpopulation of linearized library members; 

determining the amino acid sequence of the subpopulation of linearized library 
members; and 

5 determining an amino acid sequence motif for an interaction site of the binding 

compound, based upon relative abundance of different amino acid residues at each 
degenerate position within the linearized library members. 

The specific conditions under which the binding compound and ODCPL are 
contacted will vary depending upon the particular binding compound and ODCPL used 

1 0 but are chosen such that a complex can form between the binding compound and a 
subpopulation of library members that are capable of interacting with the binding 
compound. When the binding compound is an enzyme, the enzyme and the ODCPL are 
contacted under conditions that maintain the enzymatic activity of the enzyme (e.g., a 
kinase is incubated with the ODCPL under conditions that allow for phosphorylation of 

15 the library members by the kinase). 

After complexes have formed between the binding compound and the 
subpopulation of library members that are capable of interacting with the binding 
compound (referred to as the "binding subpopulation"), the binding subpopulation is 
separated from the non-binding subpopulation (e.g., those library members that do not 

20 interact with the binding compound) and the binding subpopulation is linearized. For 
most binding compounds, the separation step will precede the linearization step. 
However, in certain embodiments, the linearization step may be performed before the 
separation step. 

The method for separating the binding subpopulation of library members from 
25 the non-binding subpopulation of library members will depend upon the particular 
binding compound used. For example, in one embodiment, the binding compound is 
inrniobilized on a solid support (e.g., a column) and the binding subpopulation of library 
members remains bound to the immobilized binding compound while the non-binding 
subpopulation is washed away. Standard methods for affmity chromatography can be 
30 used to afford such separation. Binding compounds can be immobilized to a solid 
support using methods known in the art. For example, the binding compoimd can be 
prepared as a glutathione-S-transferase (GST) frision protein and immobilized by 
binding the GST fusion protein to glutathione agarose beads. Such an approach is 
suitable for many types of binding compoimds but is particularly preferred for binding 
35 domains that mediate protein-protein interactions (such as SH2 and SH3 domains) but 
that do not have enzymatic activity. 
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In another embodiment, in which the binding compound has an enzymatic 
activity, separation of the binding subpopulation of library from the non-binding 
subpopuiation can be based on enzymatic modification of the binding subpopulation by 
the binding compound. For example, when the binding compound is a kinase, the 
5 binding subpopulation of library members becomes phosphorylated while the non- 
binding subpopulation remains nonphosphorylated (discussed further below). 
Accordingly, phosphorylated peptides can be separated from nonphosphorylated 
peptides to achieve separation of the binding subpopulation from the non-binding 
subpopulation. Similarly, when the binding compound is a phosphatase, the binding 

1 0 subpopulation of library members becomes dephosphorylated while the non-binding 
subpopulation remains phosphorylated (discussed further below). Accordingly, 
nonphosphorylated peptides can be separated from phosphorylated peptides to achieve 
separation of the binding subpopulation from the non-binding subpopulation. 

The method of linearization of the binding subpopulation also will depend on the 

1 S particular binding compound and ODCPL used in the method. In a preferred 

embodiment, the ODCPL comprises a fixed dipeptide having the ammo acid sequence 
methionine-alanine. Specific cleavage of the library members at this dipeptide can be 
achieved using cyanogen bromide. In another preferred embodiment, the binding 
compound used in the method is a protease and the library members are linearized by 

20 cleavage mediated by the protease (discussed further below). 

The linearized library members are sequenced by standard amino acid 
sequencing techniques (e.g., Edman degradation). Automated peptide sequencers can be 
used to determine the amino acid sequence of the library members. Preferably, the 
ODCPL used is a soluble synthetic peptide library and the linearized subpopulation of 

25 peptides is sequenced as a bulk population using an automated peptide sequencer. This 
approach provides information on the abundance of each amino acid residue at a given 
cycle in the sequence of the complexed mixture, most importantly at the degenerate 
positions. For each degenerate position in the selected linearized peptides (e.g., the 
binding subpopulation), a relative abundance value can then be calculated by dividing 

30 the abimdance of a particular amino acid residue at that position after library screening 
(e.g., after peptide complexation and separation) by the abundance of the same amino 
acid residue at that position in the starting library. Thus, the relative abundance (RA) of 
an amino acid residue Xaa at a degenerate position in the cyclic peptide library can be 
defined as: 



R A = amount of Xaa in the population of selected linearized peptides 

amount of Xaa in the original oriented degenerate cyclic peptide library 



wo 98/54577 



PCT/US98/10876 



-10- 



The relative abundance value may be corrected for background contamination as 
described further below. Amino acid residues v^hich are neither enriched for nor 
selected against in the population of cyclic peptides which can serve as substrates for the 
5 binding compound will have a relative abundance of 1 .0. Those amino acid residues 
which are preferred at a particular degenerate position (e.g., residues which are enriched 
at that position in the complexed peptides) will have a relative abundance greater than 
1.0. Those amino acid residues which are not preferred (e.g., residues which are selected 
against at that position in the complexed peptides) will have a relative abundance less 

10 than 1.0. Based upon the relative abundance values for each amino acid residue at a 
degenerate position, preferred amino acid residues, e.g., amino acid residues with a 
relative abundance greater than 1 .0, can be identified at that position. 

Based upon the relative abundance of different amino acid residues at each 
degenerate position within the population of phosphorylated peptides, an amino acid 

1 5 sequence motif for a phosphorylation site of the protein kinase can be determined. The 
amino acid sequence motif encompasses the degenerate region of the peptides. The 
particular amino acid residues chosen for the motif at each degenerate position are those 
which are most abundant at each position. Thus, an amino acid residue(s) with a relative 
abundance value greater than 1 .0 at a particular position can be chosen as the amino acid 

20 residue(s) at that position within the amino acid sequence motif Alternatively, a higher 
relative abundance value can be used as the basis for inclusion of an amino acid residue 
to create an even more preferred phosphorylation site for a kinase. For example, an 
amino acid residue(s) with a relative abundance value equal to or greater than 1 .5 at a 
particular position can be chosen as the amino acid residue(s) at that position within the 

25 amino acid sequence motif. 

In a preferred embodiment, the ODCPL used in the method is a soluble synthetic 
library comprising cyclic peptides comprising a formula: 




30 
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wherein Zga is a non-degenerate natural or unnatural a-amino acid, X^^ is any 
natural or unnatural a-amino acid, R-R' is a dipeptide specifically cleavabie under 
cleavage conditions to allow for linearization of the peptide, and n and m are each 
independently selected from 0-10 inclusive, with the proviso that if n is 0, m is selected 
5 from 1-10 inclusive and if m is 0, n is selected from 1-10 inclusive. In a preferred 
embodiment, R-R* is methionine-alanine, which allows for linearization of the cyclic 
peptides by cleavage of any Met-X bond with cyanogen bromide. 

In the following subsections, application of the method of the invention to 
nonlimiting examples of different types of binding compoimds is described further. 

10 

A. Kinases as Binding Compounds 

In one embodiment, a binding compound used in the method is a kinase. 
Examples of kinases include protein serine kinases, protein threonine kinases, protein 

1 S tyrosine kinases and lipid kinases. When the binding compound is a kinase, the ODCPL 
is constructed such that is contains a phosphorylatable amino acid residue at a fixed, 
non-degenerate positions. As used herein, a "phosphorylatable" amino acid residue is an 
amino acid residue that is capable of being phosphorylate by a kinase. In a preferred 
embodiment, an ODCPL for use v^th serine, threonine or tyrosine kinases comprises 

20 cyclic peptides comprising a formula: 




wherein Z^^ is a non-degenerate phosphorylatable amino acid selected from the 
25 group consisting of serine, threonine and tyrosine, X^a is any natural or unnatural a- 
amino acid, R-R' is a dipeptide specifically cleavabie under cleavage conditions to allow 
for linearization of the peptide, and n and m are each independently selected from 0-10 
inclusive, with the proviso that if n is 0, m is selected from 1-10 inclusive and if m is 0, 
n is selected from 1-10 inclusive. 
30 For protein-serine/threonine specific kinases, the ODCPL preferably comprises 

cyclic peptides comprising a formula: 



wo 98/54577 



PCT/US98/10876 



- 12- 




wherein Z^a is a non-degenerate phosphorylatable amino acid selected from the 
5 group consisting of serine and threonine, Xaa is any natural or unnatural a-amino acid, 
R-R' is a dipeptide specifically cleavable under cleavage conditions to allow for 
linearization of the peptide, and n and m are each independently selected from 0-10 
inclusive, with the proviso that if nis 0, m is selected from 1-10 inclusive and if m is 0, 
n is selected from 1-10 inclusive. 
10 For protein-tyrosine specific kinases, the ODCPL preferably comprises cyclic 

peptides comprising a formula: 




15 wherein Z^a is tyrosine, X^a is any natural or unnatural a-amino acid, R-R' is a 

dipeptide specifically cleavable under cleavage conditions to allow for linearization of 
the peptide, and n and m are each independently selected from 0-10 inclusive, with the 
proviso that if n is 0, m is selected from 1-10 inclusive and if m is 0, n is selected from 
1-10 inclusive. 

20 In the foregoing ODCPLs, preferably Zaa is the only phosphorylatable amino 

acid within the cyclic peptides. However, it is possible for the ODCPL to contain 
additional phosphorylatable residues to the fixed residue and to estimate the degree of 
background which will occur when using such a library and take this background into 
consideration when evaluating the results of the library screening (discussed fiirther 

25 below). 
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The ODCPL is contacted with a kinase under conditions which allow for 
phosphorylation of a substrate by the kinase. The kinase is allowed to phosphorylate 
cyclic peptides within the ODCPL having a preferred sequence for phosphorylation by 
the kinase, thereby forming a population of phosphorylated cyclic peptides. The phrase 
5 "contacted under conditions which allow for phosphorylation of a substrate by the 
kinase" is intended to include any form of combining or incubating together of the 
kinase and the library under conditions which enable phosphorylation of substrate 
proteins or peptides by the kinase. Thus, these conditions will include the presence of 
ATP (or an ATP analogue) as a phosphate donor molecule (or analogue thereof). The 
10 phosphate of the phosphate donor molecule can be labeled, e.g., radiolabeled, to label 
peptides which become phosphorylated by the kinase (e.g., to allow for their detection 
by detecting the radiolabel). For example, ^^p.y-ATP can be used as the phosphate 
donor. 

Following phosphorylation of phosphorylatable cyclic peptides within the 
1 5 ODCPL, the phosphorylated peptides are separated from the remaining non- 
phosphorylated cyclic peptides of the library. A preferred method for separating 
phosphorylated peptides from non-phosphorylated peptides is by binding the 
phosphorylated peptides to a ferric column. This type of colunm has been used 
previously to separate tryptic phosphopeptide fragments of phosphorylated proteins from 
20 non-phosphorylated tryptic fragments (Muszynska et al., (1986) Biochem. 25: 6850- 
6853; Muszynska et ah, (1992) J. Chromatography 604:19-28). However, previously 
described protocols could not be used because the phosphopeptide were eluted from the 
column in such a way that they could not then be sequenced, which is the necessary 
subsequent step in the procedure (e.g., the washing and elution buffers were 
25 incompatible with subsequent sequencing of the phosphopeptides). The column 

washing and elution conditions were modified to allow for subsequent sequencing of the 
eluted phosphopeptides. In the modified procedure, the peptide mixture is loaded onto 
the column in high salt buffer of about pH 5.5-6.0 (e.g., 50 mM MES, 1 M NaCl, pH 
5.5). In this buffer, phosphorylated peptides bind to the colunm whereas non- 
30 phosphorylated peptides flow through the column. Next, the column is washed with a 
very low salt buffer of about pH 6.0 (e.g., 2 mM MES, pH 6.0) to remove contaminating 
non-phosphoiylated peptides and excess salt from the column. Finally, the 
phosphopeptides are eluted from the column with a buffer of about pH 8.0 (e.g., 500 mM 
NH4CO3, pH 8.0). Procedures for quantitative separation of phosphorylated peptides 
35 from non-phosphorylated peptides are described fiirther in U.S. Patent No, 5,532,1 67. 

Alternatively, since many kinases are known to be capable of using ATP-y-S as a 
thio-phosphate donor to phosphorylate proteins and peptides, a mercury (Hg2+) column 
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(pChloromercuribenzoate-Agarose: Pierce) could be used for separating 
thio-phosphorylated peptides from non-phosphorylated peptides. Such columns have 
been used in the past to bind thio-phosphoryiated nucleotides and peptides (Sun, I.Y-C, 
and AUfrey, V.G. (1982) J. Biol. Chem. 257:1347-1353 and Sun et al., (1980) J. Biol. 
5 Chem. 255:742-747). Since any thiol group will bind to the mercury colunm, Cys 

cannot be present in the library if the mercury colunm is used. Also, the kinase reaction 
is typically about 5 fold slower with ATP-y-S so more enzyme or a longer incubation 
time is likely to be necessary. 

The use of a mercury colunm to separate thio-phosphorylated peptides from non- 

10 thio-phosphorylated peptides can allow for the inclusion of phosphotyrosine, 

phosphoserine or phosphothreonine at degenerate positions in the library. In this case, 
many peptides within the library would be phosphorylated during the synthesis of the 
library and the kinase reaction would then be performed with ATP-y-S as a thio- 
phosphate donor to thiophosphorylate substrates of the kinase. The thiophosphorylated 

1 5 peptides would then be separated from the non-thio-phosphorylated peptides with a 
mercury column. 

Another theoretical alternative approach to separating phosphorylated peptides 
from non-phosphorylated peptides is to use an antibody-affinity column. For example, 
an anti-phosphotyrosine antibody-column can be used to separate peptides 

20 phosphorylated on tyrosine from non-phosphorylated peptides. However, because some 
anti-phosphotyrosine antibodies have some specificity for the amino acids surrounding 
the phosphorylated tyrosine, the antibody column could impose additional selection on 
the mixture of phosphorylated peptides. Thus, it is likely that some peptides which can 
be substrates for the kinase (e.g., a proportion of the phosphorylated peptide mixture) 

25 might be lost during the colunm separation step, thereby artifactually altering the 
mixture of peptides which is sequenced. 

Prior to the step of separating the phosphorylated (or thio-phosphorylated) 
peptides from non-phosphorylated peptides, additional purification steps may be added 
to remove other components of the kinase reaction, such as the kinase itself and the free 

30 phosphate donor. For example, the kinase can be bound to a solid support, such as a 
bead, during the kinase reaction and then the supernatant, containing the phosphorylated 
and non-phosphorylated peptides can be removed, thereby separating the kmase from the 
peptide. Chromatography can be used to separate the free phosphate donor (e. g, ATP) 
from the peptides, for example a DEAE column can be used. 

35 Following separation of the phosphorylated cyclic peptides from 

nonphosphorylated cyclic peptides, the phosphorylated cyclic peptides are linearized and 
subjected to amino acid sequencing. An amino acid sequence motif for the 
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phosphorylation site of the kinase within a cyclic peptide is determined based upon the 
relative abundance of amino acid residues at each degenerate position of the peptide 
library. 

The method of the invention can be used to determine an amino acid sequence 
5 motif for the phosphorylation site of any kinase. For example, the kinase can be a 
protein-serine/threonine specific kinase (in which case a library with a fixed non- 
degenerate serine or threonine is used), a protein-tyrosine specific kinase (in which case 
a library with a fixed non-degenerate tyrosine is used) or a dual-specificity kinase (in 
which case a library with either a fixed non-degenerate serine, threonine or tyrosine can 

10 be used). Nonlimiting examples of protein kinases which are encompassed by the 
invention can be found in Hanks et al. (1988) Science 241:42-52. 

Protein-serine/threonine specific kinases encompassed by the invention include: 
1) cyclic nucleotide-dependent kinases, such as cyclic-AMP-dependent protein kinases 
(e.g., protein kinase A) and cyclic-GMP-dependent protein kinases; 2) calcium- 

1 5 phospholipid-dependent kinases, such as protein kinase C; 3) calcium-calmoduUn- 
dependent kinases, including CaMII, phosphorylase kinase (PhK), myosin light chain 
kinases (e.g., MLCK-K, MLCK-M), PSK-Hl and PSK-C3; 4) the SNFl family of 
protein kinases (e.g., SNF 1, niml, KINl and KIN2); 5) casein kinases (e.g., CKII); 6) 
the Raf-Mos proto-oncogene family of kinases, including Raf, A-Raf, PKS and Mos; 

20 and 7) the STE7 family of kinases (e.g. STE7 and PBS2). Additionally, the protein- 
serine/threonine specific kinase can be a kinase involved in cell cycle control. Many 
kinases involved in cell cycle control have been identified. Cell cycle control kinases 
include the cyclin dependent kinases, which are heterodimers of a cyclin and kinase 
(such as cyclin B/p33cdc2^ cyclin A/p33CDK2^ cyclin E/p33CDK2 and cyclin 

25 Dl/p33CDK4)^ other cell cycle control kinases include Weel kinase, Niml/Cdrl 
kinase and Wisl kinase. 

Protein-tyrosine specific kinases encompassed by the invention include: 1) 
members of the src family of kinases, including pp60C-src^ pp60^"Src^ Yes, Fgr, FYN, 
LYN, LCK, HCK, Dsrc64 and Dsrc28; 2) members of the Abl family of kinases, 

30 including Abl, ARG, Dash, Nabl and Fes/Fps; 3) members of the epidemial growth 

factor receptor (EGFR) family of kinases, including EGFR, v-Erb-B, NEU and DER; 4) 
members of the insulin receptor (INS.R) family of growth factors, including INS.R, 
IGFIR, DILR, Ros, less, TRK and MET; 5) members of the platelet-derived growth 
factor receptor (PDGFR) family of kinases, including PDGFR, CSFIR, Kit and RET. 

35 Other protein kinases which can be used in the method of the invention include 

syk, ZAP70, Focal Adhesion Kinase, erkl, erk2, erk3, MEK, CSK, BTK, ITK, TEC, 
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25 



TEC-2, JAK-1, JAK-2, LET23, c-fms, S6 kinases (including p70S6 and RSKs), TGF- 
p/activin receptor family kinases and Clk. 

The amino acid sequence motifs determined by the method of the invention are 
useful for predicting whether a protein is a substrate for a particular protein kinase. The 
primary amino acid sequence of a known protein can be examined for the presence of 
the determined amino acid sequence motif. If the same or a very similar motif is present 
in the protein, it can be predicted that the protein could be a substrate for that protein 
kinase. 

B. Phosphatases as Binding Compounds 

In another embodiment, a binding compound used in the method is a 
phosphatase. When the binding compound is a phosphatase, the ODCPL is constructed 
such that it contains a phosphorylated amino acid residue at a fixed, non-degenerate 
positions. This phosphorylated amino acid residue then becomes dephosphorylated by 
the phosphatase. In a preferred embodiment, an ODCPL for use with serine, threonine 
or tyrosine phosphatases comprises cyclic peptides comprising a formula: 



wherein is a non-degenerate phosphorylated amino acid selected from the 
group consisting of phosphoserine, phosphothreonine and phosphotyrosine, Xaa is any 
natural or unnatural a-amino acid, R-R' is a dipeptide specifically cleavable under 
cleavage conditions to allow for linearization of the peptide, and n and m are each 
independently selected from 0-10 inclusive, vydth the proviso that if n is 0, m is selected 
from 1-10 inclusive and if m is 0, n is selected from 1-10 inclusive. Preferably, Zaa is 
the only phosphorylated amino acid within the cyclic peptides. 

For phosphatases, the incubation, separation and lineeuization steps are 
performed in as similar manner as described above for kinases except that: the 
phosphatase and the ODCPL are incubated under conditions that allow for 
dephosphorylation of library members by the phosphatase and in the separation step, 
nonphosphorylated peptides are selected (rather than phosphorylated peptides). 
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Sequencing and detennination of the amino acid motif for the dephosphorylation site are 
performed as described above for kinases. 

C. SH2 and SH3 Domains as Binding Compounds 
5 In yet another embodiment, a binding compound used in the method comprises a 

Src Homology 2 (SH2) domain or a Src Homology 3 (SH3) domain. It is known that 
SH2 domains have specificity for phosphotyrosine-containing targets. Accordingly, 
when a binding compound comprises an SH2 domain, the ODCPL is designed to contain 
a phosphotyrosine residue at a fixed, nondegenerate position. In a preferred 
10 embodiment, an ODCPL for use with SH2 domains comprises cyclic peptides 
comprising a formula: 




1 5 wherein Z^^ is a non-degenerate phosphotyrosine residue, Xg^ is any natural or 

unnatural a-amino acid, R-R' is a dipeptide specifically cleavable under cleavage 
conditions to allow for linearization of the peptide, and n and m are each independently 
selected from 0-10 inclusive, with the proviso that if n is 0, m is selected from 1-10 
inclusive and if m is 0, n is selected from 1-10 inclusive. 

20 In a preferred embodiment, the binding compound comprising an SH2 domain is 

inmiobilized on a solid support, for example by expressing the SH2 domain as a GST 
fusion protein and immobilizing the fusion protein on glutathione-agarose. To select for 
library members that interact with the SH2 domain, the ODCPL is passed over the solid 
support to allow the binding compound to interact with a subpopulation of library 

25 members capable of interacting with the SH2 domain, and library members incapable of 
interacting with the SH2 domain are washed away to thereby separate the subpopulation 
of library members capable of interacting with the SH2 domain from library members 
incapable of interacting with the SH2 domain. Methods for preparing SH2-GST fusion 
proteins and for screening linear peptide libraries against these fusion proteins are 

30 described further in Songyang et al (1993) Cell 72:161-11%. These methods can be 
adapted for use ui the cyclic peptide screening methods of the invention. 
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It is known that SH3 domains preferentially bind to proline-containing targets. 
Accordingly, when the binding compound comprises an SH3 domain, the ODCPL 
preferably contains at least one proline residue at a fixed, nondegenerate position. In a 
preferred embodiment, an ODCPL for use with SH3 domains comprises cyclic peptides 
5 comprising a formula: 




wherein Z^^ is a non-degenerate proline, is any natural or unnatural a-amino 
10 acid, R-R' is a dipeptide specifically cleavable under cleavage conditions to allow for 
linearization of the peptide, and n and m are each independently selected from 0-10 
inclusive, with the proviso that if n is 0, m is selected from 1-10 inclusive and if m is 0, 
n is selected firom 1-10 inclusive. 

Preferably, the binding compound comprising an SH3 domain is immobilized on 
15 a solid support, as described above for SH2 domains. The ODCPL then is passed over 
the solid support to allow the binding compound to interact with a subpopulation of 
library members capable of interacting with the SH3 domain, and library members 
incapable of interacting with the SH3 domain are washed away to thereby separate the 
subpopulation of library members capable of interacting with the SH3 domain from 
20 library members incapable of interacting with the SH3 domain. 

Linearization of the selected cyclic peptides, sequencing and determination of the 
amino acid sequence motif are performed as described above for kinases, 

D. Proteases as Binding Compounds 

25 In yet another embodiment, the binding compound is a protease. If the particular 

protease is known to preferentially cleave at a certain amino acid residue(s), the ODCPL 
can be designed to incorporate the preferred amino acid residue(s) at a fixed, non- 
degenerate position. For example, members of the caspase family of cysteine proteases 
are known to preferentially cleave after an aspartic acid residue (although the substrate 

30 specificity of particular caspases is influenced by amino acid residues surrounding the 
aspartic acid residue). Accordingly, when the binding compound is a caspase, the 
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ODCPL is designed to have an aspartic acid residue at a fixed, non-degenerate position. 
Alternatively, proteases which cleave peptide bonds include TNF-alpha (Tumor 
Necrosis Factor) converting enzymes which cleave Ala-Val bonds and serine proteases, 
such as chymotrypsin which preferentially cleaves the bond after an aromatic residue 
5 and trypsin which preferentially cleaves after arginine and lysine residues. 

Proteases which cleave specific peptide bonds include aspartyl proteases such as 
pepsin and cathepsin D. Proteases which belong to the metalloproteinase family include 
collagenase, stromeylsin, angiotensin-converting enzyme, fertilin-a and p, and the MMP 
(matrix-metalloproteinase) class. Suitable cysteine proteases include papain, cathepsin 
10 Bl, B2, H, L and F. The aminopeptidase family includes aminopeptidases A, N, GP160 
and leucine aminopeptidase. Additionally, the carboxypeptidase family includes 
carboxypeptidases H, M and E. 

When the binding compound is a protease, the subpopulation of library members 
that is capable of interacting with the protease is linearized by allowing the protease to 
15 cleave these library members (e.g., the linearization step is achieved using the protease 
itself). Thus, within the library, those library members that become linearized represent 
library members that are capable of interacting with the protease. 

Example 3 fiirther describes application of the method of the invention to 
proteases. 

20 

E. Antibodies as Binding Compounds 

In yet another embodiment, the binding compound is an antibody, or antigen- 
binding fragment thereof. Examples of antigen-binding fragments include Fab, Fd, Fv, 
F(ab*)2 and scFv fragments. The screening method of the invention can be used to map 
25 the epitope(s) of a monoclonal antibody, described further in Example 6. Moreover, the 
invention provides methods for generating monoclonal antibodies (e.g., for diagnostic or 
therapeutic use) using cyclic peptide libraries as immunogens, described further in 
Example 7. 



30 F. Other Binding Compounds 

The screening methods of the invention are applicable to a wide variety of 
additional binding compounds. Non-limiting examples include WW domains, PTE 
domains, PDZ domains, LIM domains, pleckstrin homology domains, zinc finger 
domains, extracellular growth factors, growth factor receptors, adhesion molecules, 

35 intercellular signaling molecules, lipid phosphatases, 7-transmembrane receptor proteins, 
proteases, ion channels, methyltransferases, ubiquitinating enzymes and peptidyl- 
transferases. 
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When information is already known about the binding preferences of a particular 
binding compound, this information can be utilized in the design of the ODCPL. For 
example, for WW domains, which are known to preferentially bind proline-rich regions, 
the ODCPL can be designed to incorporate at least one proline residue at a fixed, non- 

5 degenerate positions. For PTB domains, which have been reported to preferentially bind 
an NPXpY motif (wherein, N is asparagine, P is proline, X is any amino acid and pY is 
phosphotyrosine), this motif can be incorporated into the ODCPL. For PDX domains, 
which have been reported to preferentially bind an (S/T)XV motif (wherein S is serine, T 
is threonine, X is any amino acid and V is valine), this motif can be incorporated into the 

10 ODCPL. 

G. Other Embodiments 

Another embodiments of the screening method of the invention include use of 
the cyclic peptide libraries in combination with linear peptide libraries. That is, the same 
1 5 binding compound can be screened against both a cyclic peptide library and a linear 
peptide library. Use of cyclic peptide libraries in combination with linear peptide 
libraries for epitope mapping of monoclonal antibodies is described further in Example 
6. 

In yet another embodiment of the screening methods of the invention, the 
20 binding compound is encapsulated in a liposome prior to contacting the binding 

compound with the ODCPL (described further in Example 5), By encapsulating the 
bindmg compound within a liposome prior to peptide library screening, only those cyclic 
peptides which are permeable to lipid bilayers (and therefore good drug candidates and 
candidate lead compounds) can enter the liposome and interact with the binding 
25 compound. Thus, by combining the cyclic library screening with the lipid encapsulation 
of the binding compound, the search for cyclic-peptide-based drug candidates is 
converted into a one-step screen that incorporates both optimal substrate selection and 
membrane permeability. 

Still another embodiment of the screening method provides for duplex screening 
30 of both linear and cyclic libraries to identify optimal chain length or ring size for a 

particular binding compound. This embodiment utilizes the "tag termination technique" 
(TTT), which is described in further detail in Example 8. 

IL Oriented Degenerate Cyclic Peptide Libraries 
. 35 Another aspect of the invention pertains to oriented degenerate cyclic peptide 

libraries. In a preferred embodiment, the invention provides and ODCPL which 
comprises cyclic peptides comprising a formula: 
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wherein Z^a is a non-degenerate natural or unnatural a-amino acid, Xaa is any 
5 natural or unnatural a-amino acid, R-R' is a dipeptide specifically cleavable under 
cleavage conditions to allow for linearization of the peptide, and n and m are each 
independently selected from 0-10 inclusive, with the proviso that if n is 0, m is selected 
from 1-10 inclusive and if m is 0, n is selected from 1-10 inclusive. In other 
embodiment, m and n, independently, are selected from 1-7 inclusive. In yet another 

10 embodiment, m and n, independently, are selected from 1-5 inclusive. In a preferred 
embodiment, R-R* is methionine-alanine. In various embodiments, Zaa can be, for 
example, a non-degenerate phosphorylatable amino acid (such as serine, threonine or 
tyrosine), non-degenerate phosphorylated amino acid (such as phosphoserine, 
phosphothreonine or phosphotyrosine), a proline, a nonnatural a-amino acid, a 

1 5 hydrophobic natural or nonnatural a-amino acid or a hydrophilic natural or nonnatural 
a-amino acid. 

Preferably the cyclic peptides of the ODCPL are cyclized in a "head-to-tail" 
manner by formation of a peptide bond between the amino and carboxy terminii. 
Methods for solid-phase synthesis of "head-to-tail" cyclic peptides have been described 

20 (see e.g., McMunay, J.S. (1991) Tetrahedron Letter 32:7679-7682; Trzeciak, A. and 
Bannwarlh, W. (1992) Tetrahedron Letters 33:4557-4560; Tromelin, A. et al. (1992) 
Tetrahedron Letters 33:5197-5200; PCT Publication WO 95/34577). Methods for 
synthesizing cyclic peptides are described further in Example 1 . 

The cyclic peptides libraries of the invention contain at least one fixed, 

25 nondegenerate amino acid positions and several degenerate amino acid positions. In a 
preferred embodiment, the amino acid residues on either side of the fixed non- 
degenerate amino acid residue are degenerate (e.g., immediately N-temiinal and C- 
terminal to the non-degenerate residue), thus enabling one to determine an interaction 
site motif for the region surrounding the fixed amino acid residue. For example, four 

30 amino acid residues located on each side of the non-degenerate amino acid residue can 
be degenerate (e.g., positions -4, -3, -2, -1, +1 , +2, +3 and +4, relative to the non- 
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degenerate amino acid residue at position 0, can be degenerate). The degenerate 
positions in the peptides of an oriented degenerate cyclic peptide library can be created 
such that any one of the twenty natural amino acids, as well as unnatural a-amino acids, 
can occupy those positions. However, in order to reduce "background" events (e.g., 
5 enzymatic events at a residue other than a fixed residue), it is preferred that the 
degenerate positions not contain amino acid residues that can be acted upon by the 
particular binding compoimd being examined. Thus, for example, when the binding 
compound is a protein-serine/threonine kinases and the fixed residue is a serine or 
threonine, it is preferred that the degenerate positions not contain serine or threonine, 

10 Likewise, for a protein-tyrosine specific kinase, where the fixed residue is a tyrosine, it 
is preferred that the degenerate positions not contain tyrosine. 

However, if desired, an oriented degenerate cyclic peptide library can contain 
residues in addition to the fixed residue that can be acted upon by the binding 
compound, since it is possible to estimate the degree of backgroimd which will occur 

1 3 when using such a library and take this background into consideration when evaluating 
the results of the library screening. For example, consider the problem of including Tyr 
at the degenerate positions of a library having 8 degenerate positions that is to be used 
with a protein-Tyr kinase. The kinase may phosphorylate the tyrosine residues at the 
degenerate positions as well as the Tyr at the fixed position. The theoretical fi-action of 

20 peptides with Tyr at one of the degenerate sites in addition to the fixed site (assuming a 
degeneracy of 20 at each residue) is 8 x (l/20)=0.4. Thus, about 40% of the 
phosphopeptides purified are likely to be phosphorylated at the wrong residue. It is 
possible to get a good estimate of the extent of this problem since those peptides that are 
phosphorylated at sites other than the fixed residue will have detectable 

25 non-phosphorylated Tyr at the sequencing cycle corresponding to the fixed residue when 
the sequencing is performed on the mixture. Although these phosphopeptides will cause 
a background problem when the data are plotted, the problem is less serious than 
initially implicated ftom the 40% contamination assumed. For example, consider the 
case in which the protein-tyrosine kinase being evaluated phosphorylates the sequence 

30 Glu-Xxx-Tyr and the fixed Tyr is at position 7 in the peptides. Those peptides with Glu 
at position S will be preferentially phosphorylated. However, peptides with Glu at 
position 3 and Tyr at position S will also be phosphorylated and appear in the mixture. 
Similarly, those with Glu at 4 and Tyr at 6, Glu at 6 and Tyr at 8, Glu at 8 and Tyr at 10 
and Glu at 9 and Tyr at 1 1 will also be selected. Each of these subfamilies of peptides is 

35 far less abundant than the group with Tyr fixed at position 7 (in theory, l/20th). Thus, 
the expected result is that Glu will be very abundant at cycle 5 but will also be somewhat 
elevated (l/20th as much as in cycle 5) at cycles 3, 4, 6, 8, and 9. In general this 
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background is unlikely to be a problem. The importance of re-evaluating protein kinases 
with phosphorylatable residues at the degenerate positions is that Ser, Thr or Tyr 
residues upstream or downstream of the phosphorylated residue are already known to be 
important for some protein kinases. 
5 Additionally, certain amino acid residues may be omitted from a degenerate 

position in an oriented degenerate cyclic peptide library for practical reasons. For 
example, tryptophan and cysteine residues may be omitted from the degenerate positions 
because there can be problems with detecting these residues during amino acid 
sequencing and cysteine residues may cause peptide dimer formation. However, a 

10 peptide library containing Trp and Cys at a degenerate position can be used in the 

method of the invention. For example, a "second generation" library can be made based 
upon the amino acid sequence motif determined from initially screening a protein kinase 
with a peptide library which does not contain Trp or Cys at degenerate positions. At 
each degenerate position of this library, the peptides would have either the preferred 

15 amino acid residue determined from the initial screening or Cys or Trp. Thus, in this 
library, rather than bemg 1 of 20 possible natural residues at the degenerate position, Cys 
and Trp would be 1 of 3 possible residues at the degenerate position. Therefore, Trp and 
Cys will have a much stronger signal during amino acid sequencing of the peptides, 
thereby allowing for detection of these residues and for evaluation of their influence on 

20 the amino acid sequence motif of the phosphorylation site. 

A preferred type of oriented degenerate cyclic peptide library for use in the 
method of the invention is a soluble synthetic peptide library. The term "soluble 
synthetic peptide library" is intended to mean a population of peptides which are 
constructed by in vitro chemical synthesis, for example using an automated peptide 

25 synthesizer, and which are not connected to a solid support such as a bead or a cell. For 
general descriptions of the construction of soluble synthetic peptide libraries see for 
example Houghten, R.A., et al., (1991) Nature 354:84-86 and Houghten, R.A., et al., 
(1992) BioTechniques 13:412-421 . Standard techniques for in vitro chemical synthesis 
of peptides are known in the art. For example, peptides can be synthesized by 

30 (benzotriazolyloxy)tris(dimethylamino)-phosphonium hexafluorophosophate (B0P)/1 - 
hydroxybenzotriazole coupling protocols. Automated peptide synthesizers are 
commercially available (e.g., Milligen/Biosearch 9600). To create degenerate positions 
within peptides of a soluble synthetic peptide library, two approaches can be used. A 
preferred approach is to divide the resin upon which the peptides are synthesized into 

35 equivalent portions and then couple each aliquot to a different amino acid residue to 
create a degenerate position. After this coupling, the resin aliquots are recombined and 
the procedure is repeated for each degenerate position. This approach results in 
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approximately equivalent representation of each different amino acid residue at the 
degenerate position. Alternatively, a mixture of different amino acid residues can be 
added to a coupling step to create a degenerate position. (Ragnarsson et al. (1971) Acta. 
Chem. Scand. 25: 1487-1489). However, different amino acid residues have different 
5 coupling efficiencies and therefore if equal amounts of each amino acid are used, each 
amino acid residue may not be equivalently represented at the degenerate position. The 
different coupling efficiencies of different amino acids can be compensated for by using 
a "weighted" mixture of amino acids at a coupling step, wherein amino acids with lower 
coupling efficiencies are present in greater abundance than amino acids with higher 

1 0 coupling efficiencies. 

An alternative soluble, oriented degenerate cyclic peptide library involves 
constructing a linear peptide library attached to a soUd support (e.g., a cyclic library 
bound to beads, sometimes referred to as "one bead, many peptides" approach, for 
sequencing after proteolysis). The term "solid-support bound peptide library" is 

1 5 intended to mean a population of peptides which are connected to a solid support such as 
a bead or plastic pin. For general descriptions of the construction of solid-support bound 
peptide libraries see for example Geysen, H.M., et al. (1986) MoL Immunol 23:709-715; 
Lam, K.S., et al. (1991) Nature 354:82-84: and Pinilla, C, et al. (1992) BioTechniques 
13:901-905. For this type of library, the peptides are synthesized attached to the solid 

20 support, such as a bead, and degenerate positions are created by splitting the population 
of beads, coupling different amino acids to different subpopulations and recombining the 
beads. The final product is a population of beads each carrying many copies of a single 
unique peptide. Thus, this approach has been tenned "one bead/one peptide". 

With a solid support bound library, each isolate gives only the single amino acid 

25 sequence of that isolated peptide and therefore many isolated peptides must be 

individually sequenced before one can arrive at a consensus. A soluble synthetic peptide 
library is strongly preferred for use in the method of the invention because the bulk 
population of isolated peptides can be sequenced simultaneously, thus directly providing 
information on the relative abundance of different amino acid residues at each 

30 degenerate position within the population of peptides. 

An alternative that has been described in the art to the soluble, oriented 
degenerate cyclic peptide library of the present invention involves allowing a soluble, 
linear peptide library to interact with a binding compound. The single peptide, or a 
mixture of a limited number of peptides bmding to the compound, are then retrieved and 

35 analyzed (U.S. Patent No. 5,010,175 and U.S. Patent No. 5,225,533). In this method, 
however, in order to analyze the compositions of active peptides from libraries 
comprised of large numbers of peptides, the mass of the mixture increases. For example 
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to work with a library of 64,000,000 peptides a mixture of 3.5 grams of various peptides 
would be needed. The cyclic libraries of the present invention allow for the use of much 
lower amounts of the mixture while still achieving as great or even much greater 
diversity in sequences. 

5 With prior art approaches, it is not possible to predict the order of affinities of the 

various peptides isolated imless they are individually synthesized and compared in 
binding experiments. One cannot be certain that one has not missed the best possible 
motif because not enough isolated peptides were sequenced. In addition, while the 
soluble synthetic oriented degenerate cyclic peptide library provides predictions about 

10 substitutions that would severely reduce the affinity of the peptide, no such information 
can be obtained from a solid-support bound library. 

The diversity of the peptide library (e.g., the number of different peptides 
contained within the library) is a function of the number of degenerate residues: the 
greater the number of degenerate residues the greater the diversity. For example, a 

1 5 library in which only 2 positions are degenerate and any of the twenty natural amino acid 
can be at these degenerate positions would represent 400 unique peptides (20^) whereas 
a library in which 8 positions are degenerate and any amino acid can be at these 
positions would represent approximately 2.5 x 10^^ unique peptides (20^). An oriented 
degenerate peptide library can be prepared in which the degenerate residues encompass 

20 the region likely to interact with the binding compound. The number of amino acids 
residues that influence the substrate specificity of a binding compound will differ for 
different binding compounds. Thus, it is expected that oriented degenerate cyclic 
peptide libraries which will be useful in the method of the invention may have as few as 
one degenerate amino acid residue to as many as ten degenerate amino acid residues on 

25 either side of the fixed residue. Alternative to having an equal number of degenerate 
residues on either side of the fixed residue, one can use a library which has unequal 
numbers of degenerate residues on either side of the fixed residue (e.g., 2 on one side 
and 4 on the other, etc.). The optimal length (and corresponding optimal ring size) for 
an ODCPL for a particular binding compound can be determined using methods 

30 described further in Example 8. 

Inherent in the design of the ODCPL is the assumption that no single molecule 
(peptide or other type of molecule) in the library need be present in sufficient quantities 
to be detected or analyzed. This is because common features of a group of thousands to 
billions of molecules that have a common ability to bind to a target or to be processed by 

35 an enzyme can be determined. In addition, comparison of the properties of this selected 
group of molecules to the starting mixture, the ODCPL, does not require individual 
molecules be present in similar amounts. 
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For example, one can begin with an ODCPL (8 units) in which 1% of the 
polymers have Valine at position number one, 10% have Glycine at position number 
one, 5% have Glutamate at position number one and the remaining 1 7 amino acids are 
present at variable (but known) abundance at this position. Similarly, the abundance of 
5 the 20 amino acids at position number 2 will be variable but known (Edman sequencing 
of the mixture provides this information). The total degeneracy of the library is 20 to the 
8th power or 25.6 X 109 

Typically, about one mg (about a micromole) of an ODCPL is used for analysis. 
With a degeneracy of 20^ in the starting library and a micromole of material, the typical 
10 abundance of a peptide in the starting mixture is about 1 micromole/25.6 x 10^ =3.9 x 
10-*'^ moles or about 0.039 fmol. Even if a given peptide is 100 fold more abundant than 
the average abundance, it would still be present at a level that is about 1000 fold below 
the detection limit of any currendy known analytical technique. 

About one mg of the ODCPL (about a micromole) is passed over a column that 
1 5 contains the target protein and typically about 0. 1 % of this mixture binds preferentially 
to the target of interest (e.g., the catalytic site of a protease). Thus, about 1 microgram 
(1 nanomole) of total peptides is retained. This nanomole of material is a mixture of 
millions of different peptides, each one of which is at concentrations far below any 
detection technique. 

20 Although this is an extremely heterogenous mixture of materials, with no single 

molecule at sufficient quantities for detection, Edman degradation of the complete pool 
of selected peptides allows quantitative removal of the amino-terminal residue from 
every peptide in the mixture and reveals the relative quantity of each amino acid at this 
position. Thus it may be found, for examaple, that 8% of the selected peptides have 

25 Valine at the amino-terminus, 8% have Glycine at the amino terminus and that the 

remaining amino acids are present at about the same abundance that they were present in 
the starting library. It can be concluded that this target protein preferentially binds 
peptides with Valine at the amino-terminus because the ratio of Valine at this position in 
the selected peptides compared to the starting mixture 8%/l% = 8. In contrast, even 

30 though Glycine was present at the same abundance as Valine (8%), it can be concluded 
that Gly is selected against, since it is relatively less abundant in the selected peptides 
than in the starting library: 8%/10% = 0.8. A second round of Edman degradation would 
reveal the preference at the 2nd position from the amino-terminus, etc. Thus, the present 
procedure does not require that any molecule (peptide) be present at a detectable 

35 (analyzable) amount in the library and does not require equal (or even similar) 
representation of individual peptides in the library. 
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Another advantage of the invention is that it even allows one to deduce an 
optimal peptide from a library that lacks that peptide, as illustrated by the following 
example in which the optimal peptide for binding to a target protein is Isoleucine- 
Isoleucine-Isoleucine-Isoleucine-Isoleucine-Isoleucine. Because of the poor coupling of 
5 Isoleucine to itself, there may not be a single copy of a peptide with six Isoleucines in a 
row in the starting library that is constructed. Yet, the collection of millions of peptides 
that likely would be purified would include a preponderance of peptides with sequences 
such as Isoleucine-X-Isoleucine-X-Isoleucine-X, Isoleucine-X-Isoleucine-X-X- 
Isoleucine, X-Isoleucine-X-Isoleucine-X-Isoleucine, etc. (where X is any of the 

10 remaining amino acids). After sequencing this mixture, it would be found that 

Isoleucine is relatively more abundant than any other amino acid at all 6 positions and 
thus one would propose that Isoleucine-Isoleucine-Isoleucine-Isoleucine-Isoleucine- 
Isoleucine is most likely to be the highest affinity peptide. Hence, one can deduce the 
highest affinity polymer even if that polymer is not in the library. 

1 5 In view of the foregoing, another aspect of the invention is the use of an ODCPL 

of very high degeneracy, e.g., at least 1x10^ unique peptides, more preferably at least 
1x10^ or at least 1x10^^ unique peptides, and even more preferably at least about 
2.5xl0l0(208) unique peptides. Moreover, the invention allows for the use of ODCPL 
in which each unique peptide is present in the starting mixture at only very low amounts, 

20 e.g., less than about 1 fmol, less than about 0,5 fmol, less than about 0.05 fmol, and even 
as low as about 0.039 fmol (using 1 micromole of a library containing 20^ unique 
sequences). 

Furthermore, the invention allows for the selection of peptides of interest from 
the ODCPL, wherein millions of peptides may be present in the selected mixture, but at 
25 only very low amounts. For example, the invention allows for the selection of peptide 
mixtures that may contain at least 1x10^, at least 1x10^ or even at least 5x1 0^ different 
peptides. This selected peptide mixture may represent less than about 100 nanomoles, 
less than about 10 nanomoles or even less than about 1 nanomole of total peptides. 

30 III. Methods for Purifying Cvclic Peptides 

Yet another aspect of the invention pertains to methods for purifying cyclic 
peptides from linear peptides. The method of the invention involves: 
providing a mixture of cyclic peptides and linear peptides; 
contacting the mixture v^th a blocking agent that reacts with the free amino 
35 termini of the linear peptides to form amino-protected linear peptides; 

contacting the mixture with a binding agent that is capable of interacting with the 
amino-protected linear peptides but incapable of interacting with the cyclic peptides; and 
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separating the amino-protected linear peptides from the cyclic peptides to thereby 
purify the cyclic peptides. 

In a preferred embodiment, the blocking agent is a biotin, which biotinylates the 
free amino-termini of the linear peptides, and the binding agent is a biotin-binding agent, 
5 such as avidin or streptavidin. For example, biotinylated linear peptides can be 

separated from cyclic peptides by passing the mixture over an avidin column. Methods 
for purifying cyclic peptides using biotin-avidin affinity chromatography are described 
further in Example 2. In an alternative embodiment, the binding agent is an antibody 
that is specific for the blocking agent and the cyclic peptides are purified using antibody 
1 0 affinity chromatography . 

This invention is further illustrated by the following examples which should not 
be construed as limiting. The contents of all references and published patents and patent 
applications cited throughout the application are hereby incorporated by reference, 

15 EXAMPLE 1 : On Resin Head-to-Tail Cyclization of Peptides 

using Asp a-AUyl Esters 

It has been shown that allyl based protecting groups can be removed under mild 
conditions by Pd(0) catalyzed allyl transfer (H. Kunz & A. Waldmann (1983) Angew, 

20 Chem. InL Ed, Eng, , 23 :7 1 ; H. Kunz & C, Unverzagt (1983) Angew. Chem. InL Ed 

Eng, 23:436; P.D. Jeffrey & S.W. McCombie (1982) J. Org Chem., 47:587; B.M. Trost 
(1980) Acc. Chem, Res,, 13:385). Following resin coupling of the carboxylic acid side 
chain of the FMOC a-allyl ester of Asp, the degenerate peptide library containing an 
orienting residue (Y of Figure 1 as shown) is synthesized using standard FMOC-amino 

25 acid chemistry. Following synthesis, the FMOC protecting group on the Met residue is 
removed with piperidine, the allyl ester removed by Barany's method and head-to-tail 
cyclization achieved by activation of the free resin-bound carboxyl of Asp by the 
coupling reagent HBTU. Un-cyclized peptides are blocked by biotinylation, followed by 
final side chain deprotection and resin cleavage. 

30 More specifically, the carboxylic acid side chain of Fmoc alpha-allyl esters of 

Asp is first coupled to the resin (NovaSyn TGR resin, 0.25 mmoi) and the degenerate 
peptide library is then synthesized according to the standard Fastmoc cycles with HBTU 
(2-(lH-Benzotriazole-l-yl)-l,l,3,3-tetramethyluronium hexafluorophosphate) coupling 
protocols on a Peptide Biosynthesizer (Minipore Model). At the degenerate positions, 

35 1 8 diflFerent Na-FMOC-blocked amino acids (excluding Trp, Cys) are added 

simultaneously in four- fold molar excess of the coupling resin. Prior to cyclization, the 
Fmoc group on Met is removed and the allyl ester subsequently deprotected by Barany's 
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method (F. Albericio et al. in "Peptides 1992", C.H. Schneider & A.N. Eberle (eds), 
ESCOM, Leiden, 1993, ppl91; S.A. Kates et al. in "Peptides; Chemistry, Structure & 
Biology", R.S. Hodges & J.A. Smith (eds), ESCOM, Leiden, 1994, ppl 13). 

As illustrated in Figure 1, cyclization occurs upon activation of the free resin- 
5 bound carboxy group of Asp by the coupling reagent, HBTU, over 5 hours. Cyclization 
is through the reaction of the methionines' free a-amino group with the aspartic acids* 
carboxyi ends. The uncyclized peptides are then blocked by biotin under standard 
reaction conditions. The resin bound cyclic library mixture is then biotinylated in the 
following manner: biotin (0.8 mmol) was dissolved in a 1 :1 mixture of DMSO 

10 containing 0.8 mmol each of N-methylmorpholine and 1-benzotriazolyloxy-tris- 
dimethylamio-phosphonium hexafluorophosphate/0.65M diisopropylethylamine 
containing 0.8 mmol N-hydroxybenzotriazole and allowed to react with the resin-bound 
cyclized library overnight at room temperature. Final deprotection of the peptides and 
their release from the resin by TFA gave 33 mg of the crude peptides as a white solid. 

1 5 The TFA cleavage conditions were: 2 ml out of mixture of 1 0 ml TFA, 0.4 ml water, 0.5 
ml thioanisol, 0.75 g phenol and 0.25 ml ethanol diphiol, room temperature for 3 hours. 
The MALDI-TOF-MS (Matrix-Assisted-Laser-Desorption-Ionization Time-of-Flight 
Mass Spectrometry Mass Spectrum) (on Perseptive Biosystem Voyager Linear System) 
showed the expected major broad peak (@ MW 1778) and three other minor species (@ 

20 MW 3623, 6503, 7352 respectively). 

EXAMPLE 2 ; Purification of the Cyclic Peptide Library by 
Avidin-Biotin Aflinity Chromatography 

25 After the on-resin intra-molecular cyclization reaction, the uncyclized peptides 

with free amino terminals were derivatised with biotin (described in Example 1). 
Purification of the cyclic peptide library, illustrated in Figure 2A, was performed on 
immobilized avidin following established procedures. (Quesnel, A. (1995) Analytical 
Biochemistry 23i:l 82-1 87). The unwanted biotinylated linear peptides were retained in 

30 the avidin column. Briefly, the crude mixture was dissolved in phosphate buffer (PBS, 
10 mM phosphate, 150 mM NaCl, 5mM KCl, pH 7.2) and applied to a column of 
immobilized avidin (SIGMA A-9207; insolubilized on cross-linked 6% beaded agarose; 
10 ml suspension with 3.6 mg avidin per ml of packed gel; packed in 10 X 1 cm 
colunm;) with a flow rate of 0.6 ml/min. The column was washed with 10-15 bed 

35 volumes of phosphate buffer at 4°C. The eluents were collected and dried and the 
peptide mixture desalted on a Sephadex G-25 (1 .5 X 85 cm) column using water as 
eluent. Peptide elution was detected by monitoring absorbance @ 280 nm. 1 ml 



wo 98/54577 



PCT/US98/10876 



-30- 

fractions were collected and those containing peptide were pooled and lyophilized to 
yield ca. 7.0 mg of white solid. 

The initial products of the on-resin cyclization approach described in Fig. 1 were 
characterized by matrix-assisted laser-desorption ionization time-of-flight mass 
5 spectroscopy (MALDI-TOF-MS) using a Perspective Biosystem Voyager Linear System 
instrument. This revealed the expected broad peak at Mw. 1 778, and three other minor 
species at Mws. 3623, 6503, and 7352 (Figure 2B). These latter peaks are not cyclic, 
and result from dimerization and oligomerization reactions between individual resin- 
bound chains during the cyclization process. Following cyclization, the unwanted linear 

10 peptides and oligomers were removed by derivitization of the library mixture with 

biotin, followed by chromatography on an immobilized avidin column. The unwanted 
biotinylated peptides were retained, while the desired cyclic library product was eluted 
with phosphate-buffered saline. The eluents were subsequently desalted on a Sephadex 
G-25 colunm using water as the eluent. Mass spectroscopy (MALDI-TOF-MS) showed 

1 5 a single peak of Mw at 1 7 1 5 (Figure 2C), with successful removal of the three 
contaminants of higher weight by this technique. 
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The purified cyclic library product was then subjected to cleavage using CNBr 
and the resulting mixture of linear peptides was subjected to automated peptide 
sequencing (Edman degradation followed by standard 2-buffer column chromatography) 
using an Applied Biosystems Model 477A peptide sequencer. Tables I and II show the 
5 recovery of each amino acid during 15 cycles of sequencing, and the mole-percentages 
of each amino acid at each cycle. The large values of Asp and Thr in the first few cycles 
are artifactual, resulting fi*om sample and buffer impurities. As expected fi-om the CNBr 
cleavage following methionine, the dominant amino acid in the first cycle is alanine, the 
predominant amino acid cycle in 6 is the orienting residue Tyr, and the predominant 
10 anwno acids at cycles 11-14 correspond to the sequence Ala-Arg-Arg-Asn (see Figure 1). 

EXAMPLE 3 : Use of Cyclic Peptide Libraries to Identify 
Optimal Protease Substrates 

1 5 A cyclic peptide library was incubated with the protease of interest under 

reaction conditions where cleavage proceeded to a final end-state (e.g., no fiirther cyclic 
peptide molecules are capable of serving as substrate for the protease), and the progress 
of the reaction monitored over time. Based on this information, a time point was chosen 
that corresponded to cleavage of only 2-20% of the library mixture, effectively selecting 

20 for the best protease substrates under the limiting conditions. The protease reaction was 
stopped, and the resulting mixture was then subjected to standard amino acid 
sequencing. The uncleaved cyclic molecules, lacking a fi-ee amino-terminus, were 
unable to participate in the Edman degradation reaction used for sequencing, and 
consequently did not contribute to the sequencmg result Similarly, because the protease 

25 of interest acted catalytically, and was present at <1 : 1 00 molar ratio compared with the 
cyclic peptide library, it was not necessary to remove the protease firom the cleaved 
cyclic peptide products prior to sequencing (although this can easily be performed, if 
necessary, using standard size-exclusion chromatography). 

As shown in Figure 4, 500 figs of a cyclic peptide library (sequence 

30 MAXXXXYXXXXARRN, cyclized head-to tail via a M-N peptide bond) was reacted 
with 5 figs of bovine chymotrypsin in phosphate-buffered saline solution at room 
temperature for up to 120 min. Aliquots were removed at progressive time points, and 
protease-mediated cleavage of the cyclic library monitored by the appearance of free 
amino terminii as measured by reaction with ninhydrm. This analysis revealed complete 

35 cleavage of the cyclic peptide library within 40 minutes, with a linear rate of cleavage 
during the first 10 minutes. To determine the optimal protease substrate sequence, a 
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second reaction was then performed for 8 minutes, at which point approximately 20% of 
the cyclic peptide library would have been cleaved. The protease reaction was 
terminated by addition of 1 mM phenylmethyl sulfonyl fluoride with heating to lOO^C, 
and the entire mixture subjected to automated peptide sequencing on an Applied 
5 Biosystems Model 477A sequencer. 

The resulting sequences in Table III revealed a marked excess of Tyr in j>osition 
15, confirming that cleavage occurred following this aromatic residue, as expected from 
the well-known specificity of chymotrypsin. To determme the optimal substrate 
sequence, the relative abundance of amino acids in positions flanking the tyrosine 

10 residue (e.g., cycles 11-14 which correspond to residues N-terminal to the tyrosine 
cleavage site, and cycles 1-4 which correspond to residues C-terminal to the tyrosine 
cleavage site), were compared with the abundance of each amino acid in the starting 
cyclic library mixture, based on amino acid sequencing following CNBr cleavage As 
shown in Figures 3A, 33 and 3C, this revealed a preference for Arg, His and Pro in the 

15 Tyr-2 position. Pro or Asn in the Tyr-1 position, and Ala, Lys or Val in the Tyrf 1 
position (See also. Table II). These findings are in agreement with the preferred 
substrate specificity for chymotrypsin based on studies using synthetic linear di-and 
tripeptide analogues and confirm the applicability of this technique, although certain 
differences were found using the cyclic libraries, as compared to studies using linear 

20 substrates 

The substrate specificity for chymotrypsin has been studied by synthesizing 
linear peptides with C terminal chromophoric groups that are released to measure 
enzyme activity. Hence, there are no amino acids C terminal to the scissile bond, and 
the amino terminal residue is often reacted with a blocking group which can alter 

25 enzyme activity. Comparison of the preferred substrate specificity for chymotrypsin, 
based on studies using these linear di- and tripeptide analogues (Toszer et al. Acta 
Biochim. Biophys. Hung. 21:3351-348 (1986); Schellenberger et al, Eur. J. Biochem. 
199:623-636 (1991)) shows both similarities and differences in the information obtained 
from cyclic vs. linear substrates, (see Table IV), demonstrating that a different subset of 

30 information can be obtained from cyclic library screening that could not be derived by 
linear peptide screening. Table IV compares the substrate specificity found with the 
ODCPL technique with several commercially available substrates for chymotrypsin. 
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TABLE IV 



Substrate Specificity for Chvmotrvpsin Assayed Using 
A Cyclic Peptide Library and Commercially Ayailable Synthetic Chymotrvpsin 
5 Substrates Based On Optimal Cleanage Of Linear Peptides 



Residue or Group 



Cyclic Library 



Synthetic Substrate 1 



Synthetic Substrate 2 Succ 



Sjmthetic Substrate 3 Ac 



Synthetic Substrate 4 



eYi3 2Y:2 gYA Tyr eY±L 



R/H/P P/N 



Succ 



Succ 



Y 
F 
F 
F 



A/K/W 

MCA 

MCA 

pNA 

pNA 



10 



pNA = para-nitroanilide, which absorbs light after release. 
MCA = (7-methoxycoumarin-4-yl) acetylmethoxysuccinyl, which fluoresces 
after release. 



EXAMPLE 4 : Use of Cyclic Peptide Library to Determine 

Protein Tyrosine Kinase Specificity 

15 A Cyclic Peptide Library oriented around a fixed tyrosine residue was reacted 

with the protein tyrosine kinase src, in the presence of unlabelled and trace amounts of 
32p-y,ATP untiM% of the peptides were radiolabelled. Enzyme and ATP were 
removed by DEAE column chromatography, and unphosphorylated peptides separated 
fi-om phosphorylated peptides using a ferric chloride column. The purified 

20 phosphorylated cyclic peptides were cleaved by CNBr, and then subjected to peptide 
sequencing using an Applied Biosystems Model 477A sequencer. 
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TABLEV 

Substrate Specificity for the Protein Tyrosine Kinase src Assayed using the Cyclic 
Peptide Library as Substrate 

pY-4 pY-3 pY-2 pY-1 pTvr pY+1 pY+2 pY+3 pY+4 
X T(L6) T(3.6) E(1.4) G(1.5) E(1.6) E(1.7) X 

G(1.4) T(1.4) G(1.6) 0(1.4) 

E(1.3) 8(1.4) 

5 

Following cyclic library phosphorylation, DEAE and FeCl3 chromatography^ and 
amino acid sequencing, the relative amino acid preferences were determined by 
normalizing the mole-percentage of each amino acid at each cycle from the Tyr- 
phosphorylated product to the mole-percentages present in the original un- 

10 phosphorylated cyclic library mixture. The results revealed a preference for Thr in 

position pTyr-2, as well as strong preferences for Glu and Gly in positions pY-1, pY+1, 
pY+2 and pY+3 as shown in Table V. 

The sequences obtained with the cyclic library are compared with the predicted 
kinase substrate sequence obtained with a linear library (Songyang, et al.. Nature 

15 i7i:536-539, (1995)) in Table VI. The preferred amino acids were similar at pY+1 and 
pY+2, but differences were also observed at other positions. At pY-1 , the preferred 
amino acids obtained with the cyclic library were Glutamate and Aspartate, while the 
linear library selected preferred amino acids with distinctly different, hydrophobic side 
chains. Valine, Isoleucine and Leucine. At pY+3, the preferred amino acids obtained 

20 with the cyclic library were Glutamate, Aspartate, and Serine, while the linear library 
selected Phenylalanine and Isoleucine. Thus, the results with src fiuther demonstrate 
that a different subset of information can be obtained using cyclic libraries as opposed to 
linear libraries. 

25 TABLE VI 

Substrate Specificity for the Protein Tyrosine Kinase src Assayed using the Cyclic 
Acid Linear Peptide Libraries as Substrates 

EY:3 eY:! eIXL pY+I nY+2 dY+S 

Cyclic 11^ G/T/E E/G E/G/S 

Linear E/D E^ INfL E m 
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EXAMPLE 5 ; Combining Cyclic Peptide Library Screening 

of Proteases with Lipid Encapsulation 



In this example, lipid encapsulation of proteases was combined with screening 
5 for optimal protease substrates using cyclic peptide libraries. By encapsulating the 
protease within a liposome prior to peptide library screening, only those cyclic peptides 
which are permeable to lipid bilayers (and therefore good drug candidates and candidate 
lead compounds) can enter the liposome and be cleaved by the protease. 

A protease of interest is first encapsulated in liposomes using standard and 

10 previously published techniques (see e.g., G. Gregoriadis (1 976) Methods Enzymol. 

44:218-227). Following this, the liposome encapsulated protease is then incubated with 
the cyclic peptide library under conditions where 1 0-20% of the maximal cleavage (end 
state) occurs, as described in Example 3. The final mixture (cleaved and uncleaved 
peptides) is then inmiediately subjected to standard amino acid sequencing. Since the 

1 S Edman reaction used in sequencing can only be performed when peptide molecules have 
a free amino-terminus (e.g., linear peptides), all of the uncleaved cyclic molecules are 
"invisible" and the resulting sequenced mixture uniformly identified only the optimal 
cyclic substrates for the protease. 

Consequently, by combining the cyclic library screening with the lipid 

20 encapsulation of the enzymes, the search for cyclic-peptide-based anti-protease drug 
candidates is converted into a one-step screen that incorporates both optimal substrate 
selection and membrane permeability. Although described herein with regard to 
proteases, this approach of lipid encapsulation preceding cyclic library screening can 
also be used to identify the optimal cyclic peptide substrates of other enzymes that act on 

25 peptide substrates, including, but not limited to, kinases, phosphatases, 
methyltransferases, ubiquitinating enzymes and peptidyl-transferases. 



EXAMPLE 6 i Use of Cyclic Peptide Libraries to Screen for 

Epitopes for Monoclonal Antibodies 

30 

Epitopes for antibodies (e.g., monoclonal antibodies) can be identified using an 
approach that takes advantage of linear peptide libraries to identify antibody epitopes 
comprising linear arrangements of particular amino acids and cyclic peptide libraries to 
identify antibody epitopes that comprise complex molecular surfaces. The screening 
35 procedure involves preparing immobilized monoclonal antibodies, incubating the 

immobilized antibodies with linear or cyclic peptide libraries, rapid removal of unbound 
library components, elution of the specific bound library peptides, followed by amino 
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acid sequencing. During the initial screening process, if one component of the presumed 
epitope is known (e.g., phosphotyrosine or phosphothreonine), then screening is 
performed using libraries fixed around this invariant residue. If the epitope is 
completely unknown, then the screening is performed using a panel of 20 oriented 
5 libraries, each with a different amino acid as the fixed residue. (This approach both 
selects for key residues in the epitope and also provides confirmation once the final 
epitope sequence is deduced.) Once the epitope motif is established, a search of a 
protein sequence database (e.g., Genbank) can be performed to identify proteins that 
contain the epitope motif, thereby allowing for the identification of putative proteins that 
10 might cross-react with that monoclonal antibody. Applications of this approach include 
identifying epitopes for diagnostic and therapeutic monoclonal antibodies, and 
identifying the molecular targets of auto-antibodies in patients with autoimmune 
diseases. 

1 S EXAMPLE 7 : Generation of Diagnostic and Therapeutic Monoclonai 

Antibodies Using Cyclic Peptide Library Immunogens 

The example describes an approach that can be used to generate a large panel of 
monoclonal antibodies that can then be rapidly examined for function, v^dthout having to 

20 establish the target protein of interest prior to antibody generation. Following 
identification of functional clones, epitope mapping and characterization of the 
molecular target is conducted as described in Example 6. This approach involves 
immunizing an animal with a library of peptides, followed by preparation of a panel of 
monoclonal antibodies from the immunized animal and selection of those monoclonal 

25 antibodies that have a desired functional property. 

As an example, to generate therapeutic antibodies against signalling systems that 
mediate their effects using phosphoserine proteins, a phosphoserine library is coupled to 
keyhole limpet hemocyanin (KLH) and the library immunogen is used to inoculate mice. 
Using a library of 10*7 peptides, perhaps -10^ will generate an antibody response. 

30 Splenocytes from immunized mice are then fused to hybridoma cells to produce 

monoclonal antibodies using standard techniques. These antibodies are then screened 
for the ability to interfere with the signalling reaction of interest (e.g., functional clones 
are identified) and the relevant clones are selected and expanded. Following selection, 
the epitopes of the selected antibodies can be mapped, and the native molecular targets 

35 characterized, using the approaches described in Example 6. 
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EXAMPLE 8 : Tag Termination Technique (TTT) for Duplex Screening of 
Linear 

and Cyclic Libraries to Identify Optimal Chain Length or Ring 

Size 

5 

In the tag termination technique (TTT), a portion of the growing peptide chains 
of the Hbrary is removed bom the synthesizer at different chain lengths. Each portion is 
"coded" by introduction of a series of unique chemical tags whose cleavage can be 
performed prior to opening the ring or disrupting the linear peptide bond(s). Thus, a 

1 0 library is created that contains library members of different chain lengths (for Imear 
peptides) ring sizes (for cyclic peptides), each being coded with a different tag. The 
tagged library is used to screen a binding protein of interest and librar)' members are 
selected. Following screening, the selected library members are decoded by tag removal 
and analyzed as to their size to thereby identify the optimal size library for the binding 

1 5 protein of interest. The optimal sized library is then further screened using the standard 
library approach to determine the optimal binding motif of the binding protein of 
interest. 

For example, one type of tag would involve the addition, to various portions of 
the growing peptide chains during synthesis, of a glutaraate or aspartate residue 

20 containing alkyl- or aliphatic alcohol-thioester tags of varying chain lengths. The length 
of the aliphatic chain of die tag is varied as the length of the library chains increases 
during synthesis. Following synthesis, the double degenerate library (length and 
sequence) is used to screen for target binding. The bound library peptides are eluted, the 
thioester of the tag is cleaved and the alkyl chain is separated from the peptides by 

25 reverse phase-HPLC or gas chromatography/mass spectroscopy. The peak in alkyl chain 
lengdi identifies the optimum length or ring size for the library for the particular binding 
protein being analyzed. This optimal length library is then cleaved to remove the alkyl 
thioester and the library, free from the tag, is then used to screen for binding using the 
standard peptide library approach to identify the optimal substrate binding motif for the 

30 binding protein of interest 



EQUIVALENTS 



Those skilled in the art will recognize, or be able to ascertain using no more than 
35 routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following 
claims. 
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CLAIMS 

What is claimed is: 

5 1. A method for deteraiining an amino acid sequence motif for an interaction site of 
a binding compound, comprising: 

contacting an oriented degenerate cyclic peptide library (ODCPL) with a binding 
compound under conditions which allow for interaction between the binding compound 
10 and the ODCPL, wherein the ODCPL comprises library members having an identifiable 
amino acid residue at a fixed non-degenerate position; 

allowing the binding compound to interact with the ODCPL such that a complex 
is formed between the binding compound and a subpopulation of library members 
1 5 capable of interacting with the binding compound; 

separating the subpopulation of library members capable of interacting with the 
binding compound from library members that are incapable of interacting with the 
binding compoimd; 

20 

linearizing the subpopulation of library members capable of interacting with the 
binding compound to form a subpopulation of linearized library members; 

determining the amino acid sequence of the subpopulation of linearized library 
25 members; and 

determining an amino acid sequence motif for an interaction site of the binding 
compound, based upon relative abimdance of different amino acid residues at each 
degenerate position within the linearized library members. 

30 

2. The method of claim 1 , wherein the ODCPL is a soluble synthetic peptide 
library. 



35 



3. The method of claim 1 , wherein the ODCPL comprises cyclic peptides 
comprising a formula: 
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wherein Z^a is a non-degenerate natural or unnatural a-amino acid, Xga is any 
natural or unnatural a-amino acid, R-R' is a dipeptide specifically cleavable under 
5 cleavage conditions to allow for linearization of the peptide, and n and m are each 

independently selected from 0-10 inclusive, with the proviso that if n is 0, m is selected 
from l-IO inclusive and if m is 0, n is selected from 1-10 inclusive. 

4. The method of claim 3, wherein R-R' is methionine-alanine. 

10 

5. The method of claim 4, wherein library members are linearized by cleaving R-R* 
with cyanogen bromide. 

6. The method of claim 1, wherein the binding compound is a kinase. 

15 

7. The method of claim 6. wherein the ODCPL comprises cyclic peptides 
comprising a formula: 




wherein Z^a is a non-degenerate phosphorylatable amino acid selected from the 
group consisting of serine, threonine and tyrosine, X^a is any natural or imnatural a- 
amino acid, R-R' is a dipeptide specifically cleavable under cleavage conditions to allow 
for linearization of the peptide, and n and m are each independently selected from 0-10 
25 inclusive, with the proviso that if n is 0, m is selected from 1-10 inclusive and if m is 0, 
n is selected from 1-10 inclusive. 
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8. The method of claim 7, wherein Zaa is the only phosphorylatable amino acid 
within the cyclic peptides. 

5 9. The method of claim 6, wherein the protein kinase is a protein-serine/threonine 
specific kinase and the ODCPL comprises cyclic peptides comprising a formula: 




10 wherein is a non-degenerate phosphorylatable amino acid selected from the 

group consisting of serine and threonine, X^a is any natural or unnatural a-amino acid, 
R-R* is a dipeptide specifically cleavable under cleavage conditions to allow for 
linearization of the peptide, and n and m are each independently selected from 0-10 
inclusive, with the proviso that if n is 0, m is selected from 1-10 inclusive and if m is 0, 

IS n is selected from 1-10 inclusive. 

10. The method of claim 6, wherein the protein kinase is a protein-tyrosine specific 
kinase and the ODCPL comprises cyclic peptides comprising a formula: 




wherein Z^^ is tyrosine, Xg^ is any natural or unnatural a-amino acid, R-R* is a 
dipeptide specifically cleavable under cleavage conditions to allow for linearization of 
the peptide, and n and m are each independently selected from 0-10 inclusive, with the 
25 proviso that if n is 0, m is selected from 1-10 inclusive and if m is 0, n is selected from 
I -10 inclusive. 
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1 1 . The method of claim 6, wherein the kinase phosphorylates the subpopulation of 
library members capable of interacting with the kinase and then the subpopulation of 
phosphorylated librar>' members is separated from nonphosphoryiated library members. 

5 

12. The method of claim 1 1 , wherein the subpopulation of phosphorylated library 
members is separated from nonphosphoryiated library members by binding the 
subpopulation of phosphorylated library members to a ferric colunm or to an anti- 
phosphotyrosine antibody column. 

10 

13. The method of claim 1, wherein the binding compound is a phosphatase. 

14. The method of claim 13, wherein the ODCPL comprises cyclic peptides 
comprising a formula: 

15 




wherein Z^a is a non-degenerate phosphorylated amino acid selected from the 
group consisting of phosphoserine, phosphothreonine and phosphotyrosine, Xaa is any 
20 natural or unnatural a-amino acid, R-R* is a dipeptide specifically cleavable under 
cleavage conditions to allow for linearization of the peptide, and n and m are each 
independently selected from 0-10 inclusive, with the proviso that if n is 0, m is selected 
from 1-10 inclusive and if m is 0, n is selected from 1-10 inclusive. 

25 15. The method of claim 1 4, wherein Zaa is the only phosphorylated amino acid 
within the cyclic peptides. 

16. The method of claim 14, wherein the phosphatase dephosphorylates the 
subpopulation of library members capable of interacting with the phosphatase and then 
30 the subpopulation of dephosphorylated library members is separated from 
phosphorylated library members. 
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1 7. The method of claim 1 6, wherein the subpopulation of dephosphorylated library 
members is separated from phosphorylated library members by binding the 
phosphorylated library members to a ferric column or to an anti-phosphotyrosine. 

5 

18. The method of claim 1, wherein the binding compound comprises an SH2 
domain. 

19. The method of claim 18, wherein the ODCPL comprises cyclic peptides 
1 0 comprising a formula: 




wherein Zaa is a non-degenerate phosphotyrosine residue, Xga is any natural or 
15 unnatural a-amino acid, R-R* is a dipeptide specifically cleavable under cleavage 

conditions to allow for linearization of the peptide, and n and m are each independently 
selected from 0-10 inclusive, with the proviso that if n is 0, m is selected from 1-10 
inclusive and if m is 0, n is selected from 1-10 inclusive. 

20 20. The method of claim 1 8, wherein the binding compoimd comprising an SH2 
domain is immobilized on a solid support, the ODCPL is passed over the solid support 
to allow the binding compound to interact with a subpopulation of library members 
capable of interacting with the SH2 domain, and library members incapable of 
interacting with the SH2 domain are washed away to thereby separate the subpopulation 

25 of library members capable of interacting with the SH2 domain from library members 
incapable of interacting with the SH2 domain. 

21 . The method of claim 1 , wherein the binding compound comprises an SH3 
domain. 

30 
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22. The method of claim 21 , wherein the ODCPL comprises cyclic peptides 
comprising a formula: 




wherein Z^a is a non-degenerate proline, X^a is any natural or unnatural a-amino 
acid, R-R* is a dipeptide specifically cleavable under cleavage conditions to allow for . 
linearization of the peptide, and n and m are each independently selected from 0-10 
inclusive, with the proviso that if n is 0, m is selected from 1-10 inclusive and if m is 0, 
10 n is selected from 1-10 inclusive. 

23 . The method of claim 2 1 , wherein the binding compound comprising an SH3 
domain is inmiobilized on a solid support, the ODCPL is passed over the solid support 
to allow the binding compound to interact with a subpopulation of library members 
15 capable of interacting with the SH3 domain, and library members incapable of 

interacting with the SH3 domain are washed away to thereby separate the subpopulation 
of library members capable of interacting with the SH3 domain from library members 
incapable of interacting with the SH3 domain. 

20 24. The method of claim 1 , wherein the binding compound is a protease and the 

subpopulation of library members capable of interacting with the protease are linearized 
by allowing the protease to cleave the library members. 

25. The method of claim 1 , wherein the binding compoimd is an antibody, or 
25 antigen-binding fragment thereof. 

26. The method of claim 1, wherein the binding compound comprises a domain 
selected from the group consisting of WW domains, PTB domains, PDZ domains, LIM 
domains, pleckstrin homology domains and zinc finger domains. 
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27. The method of claim 1 , wherein the binding compound is selected from the 
group consisting of extracellular growth factors, growth factor receptors, adhesion 
molecules, intercellular signaling molecules, lipid phosphatases, 7-transmembrane 
receptor proteins, proteases, ion channels, methyltransferases, ubiquitinating enzymes 

5 and peptidyl-transferases. 

28. The method of claim 1 , wherein the binding compound is encapsulated in a 
liposome prior to contacting the binding compound with the ODCPL. 

10 29. The method of claim 1, wherein in the ODCPL contains at least 1 x 10^ unique 
peptides. 

30. The method of claim 1 , wherein in the ODCPL contains at least 1x10^ unique 
peptides. 

15 

31. The method of claim 1, wherein in the ODCPL contains at least 1 x 10^0 unique 
peptides. 

32. The method of claim 1 , wherein each unique peptide in the ODCPL is present at 
20 an amount less than about 1 finol. 

33. The method of claim 1, wherein each unique peptide in the ODCPL is present at 
an amount less than about O.S fmol. 

25 34. The method of claim 1 , wherein each unique peptide in the ODCPL is present at 
an amount less than about COS fmol 

35. The method of claim 1 , wherein the subpopulation of library members contains 
at least 1 x 10^ different peptides. 

30 

36. The method of claim 1 , wherein the subpopulation of library members contains 
at least 1 x 10^ different peptides. 

37. The method of claim 1 , wherem the subpopulation of library members contains 
35 at least 5 x 1 0^ different peptides. 
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38. The method of claim 1 , wherein the subpopulation of library members represents 
less than 100 nanomoles of total peptides, 

39. The method of claim 1 , wherein the subpopulation of library members represents 
S less than 10 nanomoles of total peptides. 

40. The method of claim 1 , wherein the subpopulation of library members represents 
less than 1 nanomole of total peptides. 

10 41. An oriented degenerate cyclic peptide library (ODCPL), which comprises cyclic 
peptides comprising a formula: 




1 5 wherein Z^a is a non-degenerate natural or unnatural a-amino acid, X^a is any 

natural or unnatural a-amino acid, R-R* is a dipeptide specifically cleavable under 
cleavage conditions to allow for linearization of the peptide, and n and m are each 
independently selected from 0-10 inclusive, with the proviso that if n is 0, m is selected 
from 1-10 inclusive and if m is 0, n is selected from 1-10 inclusive. 

20 

42. The ODCPL of claim 41 , wherein R-R* is alanine-methionine. 

43. The ODCPL of claim 41, wherein is a non-degenerate phosphorylatable 
amino acid selected from the group consisting of serine, threonine and tyrosine. 

25 

44. The ODCPL of claim 41 , wherein Z^a is a non-degenerate phosphorylated amino 
acid selected from the group consisting of phosphoserine, phosphothreonine and 
phosphotyrosine. 

30 45. The ODCPL of claim 44, wherein is phosphotyrosine. 
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46. The ODCPL of claim 41 , wherein Zaa is proline. 

47, The ODCPL of claim 41, wherein Z^a is a nonnatural a-amino acid. 

5 48. The ODCPL of claim 41 , wherein Z^a is a hydrophobic natural or nonnatural 
a-amino acid. 

49. The ODCPL of claim 4 1 , wherein Z^a is a hydrophilic natural or nonnatural 
a-amino acid. 

10 

50. A method for purifying cyclic peptides from linear peptides comprising: 
providing a mixture of cyclic peptides and linear peptides; 

contacting the mixture with a blocking agent that reacts with the free amino 
termini of the linear peptides to form amino-protected linear peptides; 
1 5 contacting the mixture with a binding agent that is capable of interacting with the 

amino-protected linear peptides but incapable of interacting with the cyclic peptides; and 

separating the amino-protected linear peptides from the cyclic peptides to thereby 
purify the cyclic peptides. 

20 51. The method of claim 50, wherein the blocking agent is biotin, 

52. The method of claim 5 1 , wherein the binding agent is avidin or streptavidin. 

53. A method for determining an amino acid sequence motif for an interaction site of 
25 a protease, comprising: 

contacting an oriented degenerate cyclic peptide library (ODCPL) with a protease 
under conditions which allow for interaction between the protease and the ODCPL, 
wherein the ODCPL comprises library members having an identifiable amino acid 
30 residue at a fixed non-degenerate position; 

allowing the protease to interact with the ODCPL such that a complex is formed 
between the protease and a subpopulation of library members capable of interacting with 
the protease; 

35 

linearizing the subpopulation of library members capable of interacting with the 
protease to form a subpopulation of linearized library members; 
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determining the amino acid sequence of the subpopulation of linearized library 
members; and 

5 determining an amino acid sequence motif for an interaction site of the protease, 

based upon relative abundance of different amino acid residues at each degenerate 
position within the Hnearized library members. 

54. The method of claim 53, wherein said protease is chymotrypsin. 

10 

55. The method of claim 53, wherein said protease is from the serine protease family, 
cysteine protease family, metal loproteinase family, aminopeptidase family or 
carboxypeptidase family. 
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